Exhibit C 



>NM_011858 ACCESSION :NM_0118 58 NID: gi 7549780 ref NM_011858.1 Mus 

musculus odd Oz/ten-m homolog 4 (Drosophila) (Odz4) , mRNA 
Length = 9722 

Score = 5272 bits (13524), Expect = 0.0 

Identities = 2498/2591 (96%), Positives = 2542/2591 (97%), Gaps = 2/2591 (0%) 
Frame = +1 

Query : 165 DHPGGLQNHARLRTPPPPLSHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSGEP 224 

DHP LQNH RLRTPPPPL HAHTPNQHHAAS INSLNRGNFTPRSNPS PAPTDHSLSGEP 
Sbjct: 1285 DHPSSLQNHPRLRTPPPPLPHAHTPNQHHAAS INSLNRGNFTPRSNPS PAPTDHSLSGEP 1464 

Query: 225 PAGGAQEPAHAQENWLLNSNIPLETRNLGKQPFLGTLQDNLIEMDILGASRHDGAYSDGH 284 

PAG AQEP HAQ+NW+LNS IP+ETRNLGKQPFLGT QDNLIEMDI ASR DGAYSDGH 
Sbjct: 1465 PAGSAQEPTHAQDNWVLNSKIPVETRNLGKQPFLGTWQDNLIEMDIFSASRRDGAYSDGH 1644 

Query: 285 FLFKPGGTS PLFCTTS PGYPLTS ST VT S PPPRPLPRSTFARPAFNLKKPSKYCNWKC AAL 344 

F FKPGGTS PLFCTTS PGYPLTS STVYS PP PRPLPRSTF+RPAFNLKKPSKYCNWKC AAL 
Sbjct: 1645 FFFKPGGTS PLFCTTS PGYPLTS STVYS PPPRPLPRSTFSRPAFNLKKPSKYCNWKCAAL 1824 

Query: 345 SAIVISATLVILLAYFVAMHLFGLNWHLQPMEGQM--YEITEDTASSWPVPTDVSLYPSG 402 

S AI + I SATLVILLAYFVAMHLFGLNWHLQPMEGQM YEITEDTASSWPVPTDVSLYPSG 
Sbjct: 1825 SAILISATLVILLAYFVAMHLFGLNWHLQPMEGQMQMYEITEDTASSWPVPTDVSLYPSG 2004 

Query: 403 GTGLETPDRKGKGTTEGKPSSFFPEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPV 462 

GTGLETPDRKGKG EGKPSS FPEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPV 
Sbjct: 2005 GTGLET PDRKGKGAAEGKP SSLFPEDSF I DSGE I DVGRRASQKI P PGTFWRSQVF I DH PV 2184 

Query: 463 HLKFNVSLGKAALVGIYGRKGLPPSHTQFDFVELLDGRRLLTQEARSLEGTPRQSRGTVP 522 

HLKFNVSLGKAALVGIYGRKGLPPSHTQ DFVELLDGRRLLTQEARSLEG RQSRG VP 
Sbjct: 2185 HLKFNVSLGKAALVGIYGRKGLPPSHTQLDFVELLDGRRLLTQEARSLEGPQRQSRGPVP 2364 

Query: 523 PSSHETGFIQYLDSGIWHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCH 582 

PSSHETGFIQYLDSGIWHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCH 
Sbjct: 2365 PSSHETGFIQYLDSGIWHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCH 2544 

Query: 583 CFLGFLGPDCGRASC PVLC SGNGQYMKGRCLCHSGWKGAECDVPTNQC IDVAC SNHGTC I 642 

CFLGFLGPDCGRASCPVLCSGNGQYMKGRCLCHSGWKGAECDVPTNQCIDVACS+HGTCI 
Sbjct: 2545 CFLGFLGPDCGRASCPVLCSGNGQYMKGRCLCHSGWKGAECDVPTNQCIDVACSSHGTCI 2724 

Query : 643 TGTCICNPGYKGESCEEVDCMDPTCSGRGVCVRGECHCSVGWGGTNCETPRATCLDQCSG 702 

GTCICNPGYKGESCEEVDCMDPTCS RGVCVRGECHCSVGWGGTNCETPRATCLDQCSG 
Sbjct: 2725 MGTCICNPGYKGESCEEVDCMDPTCSSRGVCVRGECHCSVGWGGTNCETPRATCLDQCSG 2904 

Query: 703 HGTFL PDTGLC SCD PSWTGHDC S I E IC AADCGGHGVC VGGTC RC EDGWMGAACDQRACH P 762 

HGTFL PDTGLC +CD PSWTGHDC S I E IC AADCGGHGVC VGGTC RC EDGWMGAACDQRACH P 
Sbjct: 2905 HGTFLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHP 3084 

Query: 763 RC AEHGTC RDGKC EC S PGWNGEHCT I EGC PGLCNGNGRCTLDLNGWHC VC QLGWRGAGCD 822 

RC AEHGTC RDGKC EC + PGWNGEHCT I EGC PGLCNGNGRC TLDLNGWHC VC QLGWRG GC D 
Sbjct: 3085 RCAEHGTCRDGKCECT PGWNGEHCT I EGC PGLCNGNGRC TLDLNGWHC VCQLGWRGTGCD 3264 

Query: 823 TSMETACGDSKDNDGDGLVDCMDPDCCLQPLCHINPLCLGSPNPLDIIQETQVPVSQQNL 882 

TSMET CGD KDNDGDGLVDCMDPDCCLQPLCH+NPLCLGSP+ PLDI IQETQ PVSQQNL 
Sbjct: 3265 TSMETGCGDGKDNDGDGLVDCMDPDCCLQPLCHVNPLCLGSPDPLDIIQETQAPVSQQNL 3444 



Query: 


883 


Sbjct : 


3445 


Query: 


943 


Sbjct: 


3625 


Query: 


1003 


Sbjct : 


3805 


Query: 


1063 


Sbjct: 


3985 


Query: 


1123 


Sbjct: 


4165 


Query: 


1183 


Sbjct: 


4345 


Query: 


1243 


Sbjct: 


4525 


Query: 


1303 


Sbjct: 


4705 


Query: 


1363 


Sbjct : 


4885 


Query : 


1423 


Sbjct: 


5065 


Query : 


1483 


Sbjct: 


5245 


Query: 


1543 


Sbjct: 


5425 


Query : 


1603 


Sbjct: 


5605 


Query : 


1663 


Sbjct: 


5785 



+ S F YDRI KFLVGRDSTH IPGENPFDGGHACVIRGQVMTSDGTPLVGVNISF+NNPLFGY 



TISRQDGSFDLVTNGGISIILRFERAPFITQEHTLWLPWDRFFVMETI+MRHEENEIPS 



DLSNFARPNPWSPSPLTSFASSCAEKGPIVPEIQALQEEI I+GCKMRLSYLSSRTPGY 



KSV+RISLTHPTIPFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGL 



S EAFVSVGYE YE SC PDL I LWEKRT VLQGYE I DAS KLGGWSLDKHHALNI QSG I LHKGNG 



ENQFVSQQPPVIGSIMGNGRRRSISCPSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYI 



RRIFPSGNVTNILE+SHSPAHKYYLATDPMSGAVFLSD+NSRRVFK+KST WKDLVKNS 



EWAGTGDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRR+DQNGII 



STLLGSNDLTSARPLSCDSVM+ISQV LEWPTDLAINPMDNSLYVLDNNWLQISENHQV 



RIVAGRPMHCQVPGID FLL SKVAI HATLE S ATALAVSHNGVL Y I AETDEKKINRI RQVT 



TSGEISLVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNI 



RI RF I RKNKPFLNTQNMYELS S P I DQELYLFDT+ GKHL YTQSLPTGDYL YNFTYTGDGDI 



T ITDNNGNMVNVRRDSTGMPLWLWPDGQVYWVTMGTNSAL+ SVTTQGHELAMMTYHGN 



SGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTI 



Query: 1723 TTNLSASGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKR 1782 

TT+LS SGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKR 
Sbjct: 5965 TTHLSGSGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKR 6144 

Query: 1783 NVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRK 1842 

NVTL P I DNGLNLVEWRQRKEQARGQVTVFG RLRVHNRNLLSLDFDRVTRTEKIYDDHRK 
Sbjct: 6145 NVTLPIDNGLNLVEWRQRKEQARGQVWFGPRLRVHNRNLLSLDFDRVTRTEKIYDDHRK 6324 

Query: 1843 FTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGYIAGIQRGIMSERMEYDQAGRITSRIF 1902 

FTLRILYDQAGRPS WSPSSRLNGVNVTYSPGG+IAGIQRGIMSERMEYDQAGRITSRIF 
Sbjct: 6325 FTLRILYDQAGRPSFWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRITSRIF 6504 

Query: 1903 ADGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNI 1962 

ADGK WSYTYLEKSMVL LHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNI 
Sbjct: 6505 ADGKMWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNI 6684 

Query: 1963 YQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETA 2022 

YQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETA 
Sbjct: 6685 YQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETA 6864 

Query: 2023 GMLKTINLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVI 2082 

GMLKT+NLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVI 
Sbjct: 6865 GMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVI 7044 

Query: 2083 NET PL P I DL YRYDDVSGKTEQFGKFGVI YYD INQ 1 1 TTAVMTHTKHFDAYGRMKEVQ YE I 2142 

NET PL P I DL YRYDDVSGKTEQFGKFGVI Y YDINQ 1 1 TTAVMTH + KHFDAYGRMKEVQ YE 
Sbjct: 7045 NET PL P I DL YRYDDVSGKTEQFGKFGVI YYD INQ 1 1 TTAVMTH S KHFDAYGRMKEVQ YET 7224 

Query: 2143 FRSLMYWMTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYD 2202 

FRSLMYTOTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYD 
Sbjct: 7225 FRSLMYWMTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYD 7404 

Query: 2203 LNGNLHLLSPGNSARLTPLRYDIRDRITRLGDVQYKMDEDGFLRQRGGDIFEYNSAGLLI 2262 

LNGNLHLLS PGNS ARLT PLRYD + RDRI TRLGDVQYKMDEDG LRQRGGD+FEYNSAGLLI 
Sbjct: 7405 LNGNLHLLSPGNSARLTPLRYDLRDRITRLGDVQYKMDEDGSLRQRGGDVFEYNSAGLLI 7584 

Query: 2263 KAYNRAGSWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLY 2322 

KAYNRA WSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLY+HSSSEITSLY 
Sbjct: 7585 KAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYSHSSSEITSLY 7764 

Query: 2323 YDLQGHLFAMELS SGDEF Y I ACDNI GTPLAVF SGTGLMI KQ I L YTAYGE I YMDTNPNFQ I 2382 

YDLQGHLFAMELS SGDEF Y I ACDNI GTPLAVF SGTGLMI KQ I L YTAYGE I YMDTNPNFQ I ' 
Sbjct: 7765 YDLQGHLFAMELS SGDEF Y I ACDNI GT PLAVF SGTGLMI KQ IL YTAYGE I YMDTNPNFQ I 7944 

Query: 2383 IIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDHELWKHLSSSNVMPFNLYMFKNNNPI 2442 

I IGYHGGLYDPLTKLVHMGRRDYDVLAGRWTS PDHELWK LSS++++PF+LYMFKNNNPI 
Sbjct: 7945 IIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDHELWKRLSSNSIVPFHLYMFKNNNPI 8124 

Query: 2443 SNSQD I KC FMTDVNSWLLTFGFQLHNVI PGYPKPDMDAMEP S YEL I HTQMKTQEWDNS KS 2502 

SNSQDIKCFMTDVNSWLLTFGFQLHNVIPGYPKPD DAMEPSYEL+ HTQMKTQEWDNS KS 
Sbjct: 8125 SNSQDIKCFMTDVNSWLLTFGFQLHNVIPGYPKPDTDAMEPSYELVHTQMKTQEWDNSKS 8304 

Query: 2503 ILGVQCEVQKQLKAFVTLERFDQLYGSTITSCQQAPKTKKFASSGSVFGKGVKFALKDGR 2562 

I LGVQC EVQKQLKAF VTLERFDQL YGST I TSC QQAP +TKKF AS SG S + FGKGVKF ALKDGR 
Sbjct: 8305 ILGVQCEVQKQLKAFVTLERFDQLYGSTITSCQQAPETKKFASSGSIFGKGVKFALKDGR 8484 



Query : 


2563 


VTTD I I S VANEDGRRVAA I LNHAH YL ENLHFT I DGVDTH YF VK PG P S EGDL AI LGL S GGR 


2622 






VTTDIISVANEDGRR+AAILN+AHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGR 




Sbjct : 


8485 


VTTDI I SVANEDGRRIAAILNNAHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGR 


8664 


Query : 


2623 


RTLENGVNVTVSQINTVLNGRTRRYTDIQLQYGALCLNTRYGTTLDEEKARVLELARQRA 


2682 






RTLENGVNVTVSQINT+L+GRTRRYTDIQLQY ALCLNTRYGTT+DEEK RVLELARQRA 




Sbjct : 


8665 


RTLENGVNVTVSQINTMLSGRTRRYTDIQLQYRALCLNTRYGTTVDEEKVRVLELARQRA 


8844 


Query : 


2683 


VRQAWAREQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNI 


2742 






VRQAWAREQQRLREGEEGERAWT+GEKQQVL+TGRVQGYDGFFV SVEQYPELSDSANNI 




Sbjct : 


8845 


VRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNI 


9024 


Query : 


2743 


HFMRQSEMGRR 2753 








HFMRQSEMGRR 




Sbjct: 


9025 


HFMRQSEMGRR 9057 
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□ 1: NM 011858 , Mus musculus odd ...[gi:7549780] Links 

LOCUS NM_011858 9722 bp mRNA linear ROD 06-OCT-2003 

DEFINITION Mus musculus odd Oz/ten-m homolog 4 (Drosophila) (Odz4) , mRNA. 

ACCESSION NM_011858 

VERSION NM_011858.1 GI: 7549780 

KEYWORDS 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 9722) 

AUTHORS Ben-Zur,T., Feige,E., Motro,B. andWides,R. 

TITLE The mammalian Odz gene family: homologs of a Drosophila pair-rule 

gene with expression implying distinct yet overlapping 
developmental roles 
JOURNAL Dev. Biol. 217 (1), 107-120 (2000) 
MEDLINE 20092737 
PUBMED 10625539 
REFERENCE 2 (bases 1 to 9722) 

AUTHORS Wang,X.Z., Kuroda,M., Sok,J., Batchvarova, N. , Kimmel,R., Chung,?., 

Zinszner,H. and Ron , D . 
TITLE Identification of novel stress-induced genes downstream of chop 

JOURNAL EMBO J. 17 (13), 3619-3630 (1998) 
MEDLINE 98315054 
PUBMED 9649432 

COMMENT PROVISIONAL REFSEQ : This record has not yet been subject to final 

NCBI review. The reference sequence was derived from AF059485 . 1 . 
FEATURES Location/Qualifiers 
source 1. .9722 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="NIH Swiss" 
/ db_xref = " taxon : 1 0 0 9 0 " 
/ chromosome= " 7 " 
/map="7 46.5 cM" 
gene 1 . . 9722 

/gene=" Odz 4" 

/note=" synonyms: Doc4, Ten-m4" 
/ db_xre f = " GenelD : 2 3 9 6 6 " 
/ db_xr ef = " Locus ID : 2 3 9 6 6 " 
/ db_xr e f = " MG I : 2447063 " 
CDS 583. .9060 

/gene=" Odz 4" 

/note="type II transmembrane protein" 
/codon_start=l 

/product="odd Oz/ten-m homolog 4" 
/protein_id=" NP_035988 .1 " 
/db_xref="GI: 7549781" 



http://www.ncbi.nlm.nih.gov/entrez/query 10/8/2003 
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/db_xref="GeneID: 23966" 
/ db_xre f =■ Locus ID : 23966 " 
/ db_xr e f = " MG 1 : 2447063 " 

/ translation= " MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKGPQKSYSSSET 
LKAYDQDARLAYGSRVKDMVPQEAEEFCRTGTNFTLRELGLGEMTPPHGTLYRTDIGL 
PHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHE 
NTETGAPLHCSSASSTPIEQSPSPPPSPPANESQRRLLGNGVAQPTPDSDSEEEFVPN 
SFLVKSGSASLGVAANDHPSSLQNHPRLRTPPPPLPHAHTPNQHHAASINSLNRGNFT 
PRSNPSPAPTDHSLSGEPPAGSAQEPTHAQDNWVLNSKIPVETRHLGKQPFLGTWQDN 
LIEMDIFSASRRDGAYSDGHFFFKPGGTSPLFCTTSPGYPLTSSTVYSPPPRPLPRST 
F SRP AFNLKKP S KYCNWKC AAL S AI L I S ATLVI LLAYFVAMHLFGLNWHLQ PMEGQMQ 
MYEITEDTASSWPVPTDVSLYPSGGTGLETPDRKGKGAAEGKPSSLFPEDSFIDSGEI 
DVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGIYGRKGLPPSHTQLDFV 
ELLDGRRLLTQEARSLEGPQRQSRGPVPPSSHETGFIQYLDSGIWHLAFYNDGKESEV 
VSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCPVLCSGNGQYMKG 
RCLCHSGWKGAECDVPTNQCIDVACSSHGTCIMGTCICNPGYKGESCEEVDCMDPTCS 
SRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCNCDPSWTGHDCSI 
EICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHGTCRDGKCECTPGWNGE 
HCTIEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGTGCDTSMETGCGDGKDNDGDGLVD 
CMDPDCCLQPLCHVNPLCLGSPDPLDIIQETQAPVSQQNLNSFYDRIKFLVGRDSTHS 
IPGENPFDGGHACVIRGQVMTSDGTPLVGVNISFINNPLFGYTISRQDGSFDLVTNGG 
ISIILRFERAPFITQEHTLWLPWDRFFVMETIVMRHEENEIPSRDLSNFARPNPWSP 
SPLTSFASSCAEKGPIVPEIQALQEEIVIAGCKMRLSYLSSRTPGYKSWRISLTHPT 
IPFNLMKWLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGLSEAFVSVGYE 
YESCPDLILWEKRTAVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQQ 
PPVIGS IMGNGRRRS I SC PSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYI RRI FPS 
GWTNILEMSHSPAHKYYLATDPMSGAVFLSDTNSRRVFKVKSTTWKDLVKNSEWA 
GTGDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRRVDQNGI I ST 
LLGSISTOLTSARPLSCDSVMEISQVRLEWPTDLAINPMDNSLYVLDNNVVLQISENHQV 
RIVAGRPMHCQVPGIDQFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKINRIRQ 
VTTSGEISLVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVAD 
LGNIRIRFIRKNKPFLNTQNMYELSSPIDQELYLFDTSGKHLYTQSLPTGDYLYNFTY 
TGDGDITHITDNNGNMVWRRDSTGMPLWLWPDGQVYWTMGTNSALRSVTT^ 
AMMTYHGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQV 
ETSSKDDVTITTHLSGSGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHL 
LAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGPRLRVHNRNLLSLDFD 
RVTRTEKIYDDHRKFTLRILYDQAGRPSFWSPSSRLNGVNVTYSPGGHIAGIQRGIMS 
ERMEYDQAGRITSRIFADGKMWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPN 
VARQTLETIRSVGYYRNIYQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLS 
KLAETL YDTTKVS FTYDETAGMLKTVNLQNEGFTCT I RYRQ I G PL IDRQ I FRFTEEGM 
VNARFDYNYDNSFRVTSMQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQII 
TTAVMTHSKHFDAYGRMKEVQYETFRSLMYWMTVQYDNMGRWKKELKVGPYANTTRY 
SYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSARLTPLRYDLRDRITRLGD 
VQYKMDEDGSLRQRGGDVFEYNSAGLLIKAYNRASGWSVRYRYDGLGRRVSSKSSHSH 
HLQFFYADLTNPTKVTHLYSHSSSEITSLYYDLQGHLFAMELSSGDEFYIACDNIGTP 
L AVF S GTGLMI KQ I L YT A YG E I YMDTNPNFQ III G YHGGL YD PLTKL VHMGRRD YDVL 
AGRWTS PDHELWKRLS SNS I VPFHL YMFKNNNPI SNSQDI KCFMTDVNSWLLTFGFQL 
HNVI PGYPKPDTDAMEPS YELVHTQMKTQEWDNSKS ILGVQCEVQKQLKAFVTLERFD 
QLYGSTITSCQQAPETKKFASSGSIFGKGVKFALKDGRVTTDIISVANEDGRRIAAIL 
NNAHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGRRTLENGVNVTVSQINTML 
SGRTRRYTDIQLQYRALCLNTRYGTTVDEEKVRVLELARQRAVRQAWAREQQRLREGE 
EGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMRQSEMGRR" 
misc_feature 1813.. 1881 

/gene=" 0dz4 " 

/note= "putative; transmembrane-region site" 
misc_f eature 5668.. 7644 

/gene=" 0dz4" 

/note="RhsA; Region: Rhs family protein [Cell envelope 
biogenesis, outer membrane]" 
/db_xref = " CDD : COG3209 " 



http://www.ncbi.nlm.nih.gOv/entrez./query.fcgi ?cmd=Retrieve&db=nucleotide&list_uids=7549... 10/8/2003 
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BASE COUNT 
ORIGIN 

1 



2350 a 2747 c 2517 g 



gaattcggca cgagagcccc gaatctggac 
61 cattctttct ctgtgcttcc ttttctcttc 
121 gactgacgcg tccctactca cccctgggac 
181 atttcaggtt cccttggcaa ccccggtgcc 
241 cctgagaaga cctgacttct ggatggacag 
301 cggacacttg ggcagtagaa cccataccac 
361 agggggctgc agggcccgaa gacaccaact 
421 nactgaaatt accatgtgtc caaattaaaa 
481 tattcttaga cccttttaag agaagattta 
541 agggcccaat tcgtggatgt ggaagagcga 
601 aagccctatc gctcgctgac ccggcgtcgc 
661 gcagacagcg aggagggcaa gggcccacag 
721 gcatatgacc aagatgctcg cctagcctac 
781 gaggccgagg agttctgccg cacaggcact 
841 gagatgacgc cccctcatgg gactctctac 
901 tattccatgg gggccagctc tgatgcagac 
961 cacccagtgc gcctgtgggg ccggagcaca 
1021 cgggccaact ccaacctcac actcacggac 
1081 cccttgcatt gttcatctgc ttcatcaacc 
1141 tcccctccgg ccaatgagag ccagaggcgg 
1201 ccggactcag actctgagga agagtttgtc 
1261 gccagcctgg gggtcgcagc aaacgatcat 
1321 cggacgcctc ctccaccact gccccacgcc 
1381 atcaactcct tgaacagggg caacttcacc 
1441 gaccactcac tctctgggga gcccccagca 
1501 gacaactggg tactcaacag taaaatccca 
1561 ttcctaggga catggcagga caacctcatt 
1621 gatggggctt acagtgacgg gcacttctty 
1681 tgcaccacat ccccagggta ccccctaacg 
1741 cccctgcccc gcagcacttt ctcccgacca 
1801 tgcaactgga agtgtgcagc gctgagcgcc 
1861 ctcgcatact ttgtggccat gcacctgttt 
1921 gggcagatgc agatgtatga gatcacggag 
1981 gacgtctccc tgtatccctc agggggcact 
2041 ggagccgcag aaggaaagcc aagtagttta 
2101 gaaatcgatg tggggagacg ggcttcccag 
2161 caggtgttca tagaccaccc tgtacacctg 
2221 ctggtcggca tttatggcag aaaaggcctt 
2281 gagctcctgg atggaagaag gctcctaacc 
2341 cgccaatcac ggggccctgt tcccccatcc 
2401 gattcaggaa tctggcacct ggccttttac 
2461 tttctcacca ccgccattga atcagtggat 
2521 gactgcattt ccgggacctg ccactgcttc 
2581 gcatcctgtc ccgtgctgtg cagtgggaac 
2641 cacagtggct ggaagggtgc cgaatgtgac 
2701 tgcagcagtc atggcacctg catcatgggc 
2761 gagagctgcg aggaagtgga ctgcatggac 
2 821 agaggcgagt gccactgctc tgtgggatgg 
2881 acatgcttgg accagtgctc aggccacgga 
2941 tgtgacccaa gctggactgg acacgactgt 
3001 ggccatggcg tctgcgtagg aggcacctgc 
3061 tgcgaccaac gggcctgcca cccacgctgt 
3121 tgcgaatgca cccccggctg gaatggagag 
3181 tgcaatggaa atggcagatg taccctggac 
3241 ggctggcgag ggactggctg cgacacatcc 
33 01 aacgacggag atggcttggt ggactgcatg 
3361 catgtcaacc cgctgtgcct aggctcccct 
3421 gcccctgtat cccagcagaa cctgaactcc 



2079 t 

tcctccactt 
ctcagactct 
gtaattcctg 
ctgtctcctg 
tctgaggctg 
cctgttctct 
cttcccagag 
ttgcgtactt 
aagaaaaacc 
gagccagagg 
gatgctgagc 
aagtcctaca 
ggcagccgcg 
aatttcactc 
aggacagaca 
ctggaagcag 
cggtcagggc 
acagagcatg 
cctattgagc 
ttgctaggca 
cctaattcat 
cccagcagcc 
cataccccca 
ccaaggagca 
ggcagcgccc 
gtggaaacca 
gagatggaca 
ttcaagcccg 
tctagcaccg 
gcctttaacc 
atcctcatct 
ggcctaaact 
gacacagcca 
ggcttagaga 
tttccagagg 
aagattccgc 
aagttcaatg 
cctccttccc 
caagaggcaa 
agccatgaga 
aatgatggaa 
aactgcccca 
ytgggttttc 
ggccagtaca 
gtgcctacca 
acctgcatct 
cccacatgtt 
ggaggcacca 
accttcctcc 
tccattgaga 
cgctgtgagg 
gcagaacacg 
cactgcacca 
ctgaatgggt 
atggaaacgg 
gaccctgact 
gaccccctgg 
ttctacgatc 



29 others 

gacaccgggc 
gaacttcagc 
caggattgtg 
agaactgtgc 
agagagggct 
gggatccgat 
cgctggcatg 
caagggatta 
actcggccct 
ccatggacgt 
gccgctacac 
gctccagtga 
tcaaggacat 
ttcgtgaact 
tcggcctccc 
acactgtgct 
gcagttcctg 
agaacacaga 
agtccccctc 
acggtgtggc 
tcttagttaa 
tgcagaacca 
accagcatca 
accccagccc 
aggagccaac 
gaaacctagg 
ttttcagcgc 
gaggcacctc 
tgtattcgcc 
tcaagaagcc 
cagctacgct 
ggcacctgca 
gcagctggcc 
cccctgacag 
acagttttat 
ctggaacgtt 
tgtctctggg 
atactcagtt 
ggagcctgga 
ccggtttcat 
aagaatctga 
gcaactgcta 
tgggccctga 
tgaagggcag 
accagtgtat 
gcaaccccgg 
ccagcagggg 
actgcgagac 
cagacacagg 
tctgtgccgc 
atggttggat 
ggacctgccg 
tcgagggctg 
ggcactgtgt 
gctgtggaga 
gctgcctcca 
acatcatcca 
gaatcaagtt 



cactaagaga 
agttttgaat 
gcaaactagt 
atccccagga 
agtgatttct 
ggccttggag 
gagccaggta 
tttgaaggac 
gagtgcggca 
gaaggagagg 
cagctcatcg 
gaccctgaag 
ggtaccacag 
gggactggga 
acactgtggc 
atcccctgaa 
cctgtccagc 
aaccggtgct 
cccgcccccc 
ccagccaacc 
aagtggctcc 
ccctcggctc 
cgcggcctcc 
agcacccaca 
ccatgcccag 
caagcagcca 
ctcccgccgt 
cccgctcttc 
cccaccccgg 
ttccaagtac 
cgtcatcctg 
gccgatggag 
tgtgccaacg 
gaaaggcaaa 
agattctgga 
ctggagatct 
gaaagcagct 
agactttgtg 
gggtcctcag 
ccagtatctg 
agtggtttca 
tggtaacggt 
ctgtggccga 
gtgcctgtgc 
cgacgtggcc 
ctacaagggc 
tgtntgtgtc 
acccagggcc 
actttgcaac 
tgactgtggt 

gggggccgca 

ggacggcaag 
tcctggcttg 
ctgccagctg 
cggcaaggac 
gcctctctgt 
agagacacag 
cctcgtgggc 
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3481 agggacagca cacacagcat ccccggggag aacccctttg acggagggca tgcttgtgtc 

3541 atccgtggac aagtgatgac gtcagatggg accccattgg ttggcgtgaa catcagtttc 

3601 atcaataacc ctctctttgg atacacaatc agcaggcaag atggcagctt tgacctggtc 

3661 acaaatggcg gcatctccat catcttgaga ttcgaaaggg cacccttcat cacacaggag 

3721 cataccctct ggctaccctg ggatcgcttc tttgtcatgg aaaccatcgt catgagacac 

37 81 gaggagaatg agatccccag cagagacctg agcaactttg cccgtcccaa ccctgtggtc 

3841 tctccatccc cactgacatc cttcgccagc tcctgtgctg agaaaggccc cattgtccca 

3901 gaaatccagg ccctgcaaga ggaaatcgtc atcgctggct gcaagatgag gttgagctac 

3961 ctgagcagcc gcactcctgg ctataagtct gtcgtgagga tcagtctcac acaccccacc 

4021 atccccttca acctcatgaa ggtccacctc atggtggcag tcgagggtcg gctcttcagg 

4081 aagtggtttg ccgcagctcc tgacctgtcc tattacttca tctgggacaa gacagatgtc 

4141 tacaaccaga aggtgtttgg actctcagaa gcctttgttt ccgtgggtta tgagtatgaa 

4201 tcctgcccag atctgatcct gtgggagaaa aggacagcgg tgctgcaggg ctatgaaatt 

4261 gatgcctcca agctgggggg ctggagtctg gataagcacc atgccctgaa catccagagt 

4321 ggcatcctgc acaaagggaa tggagagaac cagtttgtgt cccagcagcc accggtcatc 

4381 ggcagcatca tgggcaatgg gcgcaggaga agtatctcct gccccagctg caatggtctt 

4441 gctgatggca acaagctcct ggcccccgtg gccctcacct gcggctctga tgggagtctc 

4501 tacgtgggag acttcaacta catccgcaga atcttcccct ctggaaatgt caccaacatc 

4561 ctggagatga gtcacagccc agcacacaaa tactacctgg ctacagaccc catgagtggg 

4621 gccgtcttcc tgtctgacac caacagccgg cgggtcttca aggtcaagtc caccacagtg 

4681 gtgaaggacc tggtgaagaa ctccgaggtg gtagcaggga ctggtgacca gtgcctcccc 

4741 tttgatgata cccgctgcgg agatggtggg aaagccacag aagccacgct cactaacccc 

4801 aggggaatta cagtggacaa gtttgggctc atttatttcg tggacggcac catgatcaga 

4861 cgtgttgatc aaaatggaat catctccact ttgctgggct ccaatgacct cacctcggcc 

4921 aggcccctca gctgtgactc cgtcatggag atttctcagg ttcgcctgga gtggcccaca 

4981 gacttagcca tcaacccaat ggataattct ctctatgtcc tcgacaacaa tgtggtcctg 

5041 caaatctccg aaaaccacca ggtacgcatt gtcgctggga ggcccatgca ctgccaggtc 

5101 cctggcatcg accaattcct gctgagcaag gtagccatcc atgcgaccct ggagtcagcc 

5161 actgctctgg ccgtttcaca caatggagtc ttgtacatcg ctgagaccga cgagaaaaag 

5221 atcaatcgca tcaggcaggt cactacaagt ggtgagatct cactggttgc tggtgccccc 

5281 agtggctgtg actgtaaaaa tgatgccaac tgtgactgct tctctggaga tgatggttac 

5341 gccaaggatg caaagctgaa taccccatcg tccttggctg tgtgtgctga cggggagctc 

5401 tatgtggccg acctgggaaa catccgaatt cgatttatcc ggaagaacaa gcctttcctg 

5461 aacactcaga acatgtacga gctatcctcc cccatcgacc aggagctgta cctctttgat 

5521 accagtggca agcatctgta cactcagagc ctacccacag gggactacct gtacaacttc 

5581 acttacacag gggacgggga catcacacat atcaccgaca acaatggcaa catggtgaac 

5641 gtccgccgag actctaccgg gatgcctctc tggctggtag tcccagatgg ccaggtatac 

5701 tgggtaacca tgggcaccaa cagcgcactc agaagtgtga ccacacaagg acacgagcta 

5761 gccatgatga cctaccatgg caactctggc ctcttggcaa ccaaaagcaa tgaaaacggg 

5821 tggacaacgt tttatgagta tgacagtttt ggtcgcctga caaacgtgac ctttccaact 

5881 ggccaggtga gcagtttccg aagcgataca gacagctcag tgcacgtgca ggtagagacc 

5941 tcaagcaaag atgacgtcac cataaccacc cacctgtctg gttcgggtgc cttctacacc 

6001 ctgttacaag accaagtccg gaacagctac tacatcgggg ctgatggctc cctgaggctg 

6061 ctactagcca atggcatgga agtggctytg cagactgagc cacacctgct ggctggcact 

6121 gtcaacccca ctgtaggcaa gaggaatgtc acactgccca ttgataatgg cctcaacctg 

6181 gtggagtggc ggcagcgcaa ggagcaggct cgtggccagg tcaccgtctt tggaccccgt 

6241 ytgcgggttc acaaccgaaa cctcttgtct ttggactttg accgtgtcac acgcacagag 

6301 aagatctacg atgaccatcg caagttcacc cttcggatcc tatatgacca ggcagggagg 

6361 cccagcttct ggtcacctag cagcaggctg aatggtgtta atgtgaccta ctcccctgga 

6421 ggtcacattg ctggaatcca aaggggcatc atgtctgaga gaatggaata tgatcaggca 

6481 gggcgcatca catcccggat ctttgcagac gggaaaatgt ggagctacac gtacttagag 

6541 aagtccatgg tgcttcatct ccacagccag aggcagtaca tcttcgagtt tgacaagaat 

6601 gaccgcctct cttctgtgac catgcccaac gtagcccggc agacgctgga gaccatccgc 

6661 tcagtgggct actacaggaa catctaccag cccccggaag gcaacgcctc agtcattcag 

6721 gacttcactg aggatggaca cctgttacat accttctact tgggcaccgg ccgccgggtg 

6781 atttacaagt atggcaagtt gtcaaagctg gccgagactc tgtatgacac cactaaggta 

6841 agcttcacct acgacgagac cgcagggatg ctgaagactg tcaacctaca gaatgagggc 

6901 ttcacctgca ctatccgcta ccgtcagatt gggcccctga ttgatcggca gatcttccgt 

6961 ttcacggaag aaggcatggt caatgcccgt tttgattaca actatgacaa cagtttccgt 

7021 gtgactagca tgcaggctgt gatcaatgag accccactgc ccattgacct ctaccgctat 
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7081 
7141 
7201 
7261 
7321 
7381 
7441 
7501 
7561 
7621 
7681 
7741 
7801 
7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 
8941 
9001 
9061 
9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 



gatgatgtgt 
aaccagatca 
atgaaggaag 
gataacatgg 
cgctactcct 
ccactctggc 
agcgcacggc 
gtacagtaca 
tacaattcag 
taccgatacg 
cagttttttt 
agctctgaga 
agcagtggcg 
agtggaactg 
gacacaaacc 
accaagcttg 
ccagaccatg 
atgtttaaga 
gtcaacagct 
aagccagaca 
caggaatggg 
aaggctttcg 
caacaggccc 
aagtttgcct 
gggcggagga 
attgatgggg 
atcctgggcc 
cagatcaaca 
agggcgctgt 
ctggagctgg 
cgggaagggg 
acggggcggg 
ctgtcagaca 
cagaaagggc 
ctgactgtgt 
aaaacnaaaa 
gttaattcat 
tcacactttt 
tgttttgttt 
ttttctcctt 
gcatctcctt 
tntctgtgct 
ttaagggtcc 
attgtgaatt 
cc 



cagggaagac 
ttaccacagc 
tacagtatga 
gacgggtagt 
atgagtatga 
gctacagcta 
tcacaccact 
agatggatga 
ccgggctgct 
atgggctggg 
acgcagacct 
ttacatccct 
atgagtttta 
gcttgatgat 
ccaacttcca 
tccacatggg 
aactctggaa 
acaacaaccc 
ggctcctcac 
cagatgccat 
acaacagcaa 
ttaccttaga 
ctgagacaaa 
tgaaagatgg 
tcgcagccat 
tggacaccca 
tcagtggggg 
ccatgctcag 
gcctgaacac 
ccaggcagag 
aagagggcct 
tgcaaggcta 
gcgccaacaa 
caaggcctgg 
tgtacttaac 
cttcttttta 
gcaacatcct 
ttttttttgc 
tgttttctgt 
gangaatttt 
gggtntntcc 
aacttgtctc 
aaaatcncan 
ctatccccat 



agagcagttt 
cgtcatgacc 
gactttccgg 
gaagaaggag 
tgctgatggc 
tgacctcaat 
acggtatgac 
ggatggctcc 
catcaaagcc 
acgccgagta 
gaccaacccc 
ctactatgat 
catagcttgt 
taagcagatc 
gatcatcatc 
ccgacgggat 
acgcctgagt 
catcagcaac 
ctttggattc 
ggaaccctcc 
gtctatcctc 
acgctttgac 
gaagtttgcc 
tcgagtgacc 
cttgaacaat 
ctactttgtg 
gcggagaacc 
tggcaggact 
ccgctacggg 
agctgtgcgc 
gcgggcctgg 
cgacggcttc 
tatccacttc 
acttcttgcc 
aaaaaacaaa 
acaagtgcag 
tttttttttt 
aaaaaatgga 
antgtgatca 
ccctgtcgtt 
acatccgccc 
gtgtgtaacc 
ggcccgtggg 
tttttgttcc 



gggaagtttg 
cactccaagc 
tcactcatgt 
ctgaaggtgg 
cagctgcaga 
gggaacctac 
ctccgtgacc 
ctgaggcagc 
tacaatcggg 
tccagcaaga 
accaaggtca 
ctgcaaggac 
gacaacatcg 
ctatacacgg 
ggctaccacg 
tatgatgtgc 
agcaacagca 
tctcaggaca 
cagctgcaca 
tacgagctcg 
ggggtacagt 
cagctctacg 
tccagtggtt 
actgacatca 
gctcactaty 
aaaccaggac 
ctggagaatg 
agacgctaca 
acaacagtgg 
caggcttggg 
acagatgggg 
tttgtgacct 
atgagacaga 
aaagacagct 
caaaaaaaca 
aaaacagaac 
tttaaanaaa 
agggtttttt 
tnaaaanaat 
gangatacat 
ctangtnccc 
ctttcttctc 
aacttgggaa 
cctgaagaac 



gtgtcattta 
actttgatgc 
actggatgac 
gaccctatgc 
cagtctccat 
acttgctgag 
gcatcactag 
ggggtgggga 
ctagtgggtg 
gcagccacag 
cccacctcta 
acctctttgc 
ggaccccgct 
cctatgggga 
gcggcctcta 
tggctggacg 
tcgtgccttt 
tcaagtgctt 
acgtgatacc 
tacacacaca 
gtgaagttca 
gctcgaccat 
ccatctttgg 
tcagtgtggc 
tagagaacct 
cttcggaagg 
gggtcaacgt 
ctgacatcca 
atgaggaaaa 
cccgagagca 
aaaagcagca 
cggtcgagca 
gcgagatggg 
actcttctgt 
caaaaaaaac 
agaaagatat 
naaaaacaca 
tgttttgttt 
ttcattgtct 
ttctcctctc 
agtgtgatnt 
tggctggaat 
cctngttagg 
tacatcatgc 



ctacgacatc 
ttatggcagg 
tgttcagtat 
caacactacc 
caatgacaag 
ccctgggaac 
gctgggtgat 
tgtcttcgag 
gagtgtcagg 
ccaccacctg 
cagccactcc 
catggagctg 
tgctgtcttc 
gatctacatg 
tgatccactc 
ctggacaagc 
tcatctctac 
catgacagat 
tggctatccc 
gatgaaaact 
gaagcaactc 
caccagctgc 
caagggggtc 
caatgaggat 
tcacttcacc 
tgacctggcc 
cactgtgtcc 
gctgcagtac 
ggtgcgggtg 
gcagagactg 
ggtgctgaac 
gtacccagaa 
ccgaaggtga 
ggccgtattc 
agaaacnaaa 
ttgttgcatt 
aatttggcct 
tgttttgttt 
tttgttccct 
ttggggganc 
nanacacaaa 
tgggccngaa 
ctccccaaaa 
tctggaaatt 
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